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ABSTRACT 

Discussed is the theoretical and practical inportance 
of the developnent of learning and transfer tazononies vith 
psychoaetric relevance and of the building of psychonetric 
classificatory systens vith inplications for learning and 
instruction. Psychoaetric classifications of Hunan performances aost 
often are based on the covariation of individual differences. A nodel 
is presented vhich justifies the expectation that the transfer fron 
learning one task to learning another is linearly dependent on the 
coefficient of intercorrelation between the two tasks when the 
coefficient is corrected for attenuation. Two studies so far have 
explicitly confirmed the nain deductions from this nodel. Contrary to 
the predictions, hovever, the regression curves yielded negative 
intercepts. Tvo enpirically testable explanations are offered, one of 
vhich vould be in full accordance vith the model, vhile the other 
vould call for a further assumption. (Author) 
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Statement of Focus 



Individually Guided Education (ICE) is a new comprehensive system of 
elementary education. The following components of the IGE system are in 
varying stages of development and implementation: a riew .organization for 
instruction and related administrative arrangements; i model of instructional 
programing for the individual student; and curr^iculum cc uponents in prereading, 
Tcadinq, mathematics,' motivation, and enviro«*imentaJ education. The develop- 
ment of other curriculum components, of a system fur managing instruction by 
computer, and of instructional strategies is needed to complete the system. 
Continuing programmatic research is required to provide a sound knowledge 
base for the components under development and for improved second generation 
components . Finally, systematic implementation is essential so that the prod- 
ucts will function properly in the IGE schools. 

The Cerfor plans and carries out the research, development, and imple- 
mentation c ponent.s of its IGE program in this sequence: (1) identify the 
needs and delimit the co;; onent problem area; (2) assess the possible con- 
straints—financial resources and availability of staff; (3) formulate general 
plans and specific proce ii::rcs for solving the problems; (4) secure and allo- 
cate human and material resources to carry out the plans; (5) provide for 
effective communir tinong personnel and efficient management of activi- 

ties and resources; anr: ) evaluate the effectiveness of each activity and 
its contribution to tro 1 program and correct any difficulties through feed- 
back mechanisms and appi jpriate management techniques. 

A self-renewing system of elementary education is projected in each 
participating elementary school, i.e., one which is less dependent on external 
sources for direction and is more res pons iv^ to the needs of the children attend- 
ing each particular school. In the IGE schools, Center-developed and other 
curriculum products compatible with the Center's instructional programing model 
will lead to higher student achievement and self-direction in learning and in 
conduct and also to higher morale and job satisfaction among educational per- 
sonnel. Each developmental product makes its unique contribution to IGE as 
it is implemented in the schools. The various research components add to the 
knowledge of Center practitioners, developers, and theorists. 
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Abstract 



The theoretical and practical importance of the development 
of learning and transfer taxonomies with psychometric relevance 
and of the building of psychometric classificatory systems 
with implications for Learning and instruction is discussed. 
Psychometric classifications of human performances most often 
are based on the covariation of individual differences. A model 
is presented which justifies the expectation that the transfer 
from learning one task to learning another is linearly dependent 
on the coefficient of intercorrelation between the two tasks 
when the coefficient is corrected for attenuation. Two studies 
so far have explicitly confirmed the main deductions from this 
model. Contrary to the predictions, however, the regression 
curves yielded negative intercepts. Two empirically testable 
explanations are offered, one of which would be in full accordance 
with the model, while the other would call for a further assumption. 
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Problem 



Behavioral performance relies heavily on 
prior learning. Such learning often affects 
different kinds of performance at the same time, 
as a learning activity is not usually undertaken 
with the intention to produce just one "discrete" 
performance in one "discrete" situation. This 
calls for transfer. If a learning activity Is 
really to be effective, its result has to be 
transferable, since every performance situation 
differs in some manner from every other. 

Although the reality of the transfer phenom- 
enon is undeniable, there se^ms not to be trans- 
fer to such aa extent and to such a variety of 
performances as many curriculum planners at 
one time hoped and some still may hope. More- 
over, the amount of transfer is not adequately 
distributed over a dichotomous variable; i.e. , 
from one learning activity (A) to another or to 
any other performance (B) there might be marked 
transfer, but at the same time far less to 
another activity (C) , although there might be 
transfer to both. This circumstance indicates 
why taxonomies of learning or the many psycho- 
metric classificatory systems of behavioral 
performances are at the same time numerous 
and dissatisfactory. 

One might argue that in terms of learnable 
human performance--on which we narrow the 
further discussion — a psychometric taxonomy 
woui^ be of broadest theoretical and practical 
value when it would allow for accurate predic- 
tion of transfer of learning. Instead, to date, 
transfer prediction research has rarely cared 
for the psychometric search of an economic 
and meaningful system to classify psychometric 
variables and vice-versa. Much of the transfer 
research has been conducted on the basic or 
micro level, trying different kinds of task 
analyses and different sorts of "similarity" 
measures as predictors (Skaggs, 1925; Robinson, 
192 7; Gibson, 1940; Boring, 1941; Osgood, 
1949; Gagne, Baker, & Foster, 1950; Ahlstroem, 
1961; Houston, 1964, 1965; Dallett, 1965; 
ButoUo, '968; for an extensive literature review 
see Flammer, 19 70). This work has not allowed 



differential (psychometric) psychologists to 
make a connection to learning psychology. 
Some of the latest transfer-oriented taxonomic 
sy^stems (Gagn€, 1970, 1972; Klausmeier, 1971; 
Klausmeier, Ghatala, & Prayer, in press) could 
lend themselves better to psychometric "trans- 
lations." At least the hierarchic conceptualiza- 
zation of learning activities has already proven 
to bear heavy implications on some traditional 
views of individual differences in school learn- 
ing (Gagng & Paradise, 1962; Bloom, 1971; 
Plammer, 1973). 

Psychometric taxonomies are traditionally 
based on correlations, typically in the factor 
analytic framework. Although they try to pro- 
vide theoretical and generalizable insight 
(Spearman, 1923; Thurstone , 1938; Vernon , 
1950; Meili, 1961; Guilford, 1967; and others), 
they are often used more as devices for economic 
descriptions of a given set of variables or of a 
set of individual test results. Because of a 
long-standing lack of interest in the learnability 
of the ineasured abili^ties or performances, the 
question for the adequacy of the systems to 
the prediction of transfer has rarely arisen, 
and when it has, there seems not to have been 
any doubt that transfer would very much depend 
on the correlation (Ferguson, 1954, 1956; 
Bunderson, 1964, 1967; Dunham, Guilford, & 
Hoepfner, 1966; Klauer, 1969; Fleishman, 
1972). 



Early Studies 

A first empirical evaluation of the supposed 
connection was undertaken by Gengerelli (1934). 
He administered a code transcription task, first 
as a pretest, then as a posttest. In between, 
eight different groups of Ss were subjected to 
a three-minute interpolated activity, each 
group to another, varying in degree of similarity 
with the original activity. 

The amount of retroactive inhibition was 
compared with the squared — uncorrected for 
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attcnuaUon--coefficicnt of correlation. The 
retroactive inhibition was highest with high 
correlations and lowest with those in a middle 
range, increasing again as the correlation 
decreased. 

A more recent attempt to investigate the 
connection between transfer and correlation was 
undertaken by Heinonen (1962). He hypothesized 
that the transfer of learning from test A to test B 
would be a linear function of the degrees of 
the ^ngle between the two tests in the common 
fa^ tjr space. This angle was expected to re- ' 
fleet the "functional similarity" relevant to 
transfer prediction and would be obtained 
throug.) the equation 

2 

where h = factor analytic commonality of each 
test. Heinonen assumed positive and negative 
transfer to be symmetric in the sense thai with 
small angles there would be high positive or 
high negative transfer, depending on whether 
the kind of transfer was expected a priori to be 
positive or negative. 

Heinonen tested his hypothesis, though 
only the positive transfer case, in an experi- 
ment with sensorimotor tests. Ninety-one 
5th and 6th graders were presented with 16 
tests on the first experimental day. On the 
four following days, for five minutes each day, 
they were retested on one of the 16 tests. 
Thereafter the 16 tests were readministered. 
The mean raw gains, divided by the pretest 
standard deviation of each test, were compared 
with the angles in the common factor space. 
After checking a graphic representation, 
Heinonen stopped the analysis, realizing 
"that the relationship between the gain due to 
training and the factorial distance between the 
J training test and the other tests was not close. " 

With regard to the large, already existing 
, research body on factorial classification of 
performance variables, such a finding would 
be alarming if it could be substantiated by 
further investigations. There are, however, 
at least two critical points that could account 
for the failure of a clear relation. The first 
might be the relatively short training time 
(four five-minute sessions); the other is the 
dependability of Heinonen*s angle measure on 
the test sampling. The commonality is usually 
smaller than the reliability would allow it to 
be. If any test B happened to be a "factorial 
outsider, " hg^ would tend to be low and 
' '"AB^^^A^b) would tend to be high. The mere 

inclusion of a very similar test B' into the test 
battery would raise hg^ and therefore lower 
r^g/(h^hg), although the amount of transfer 
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per test v/ould not be expected to change. 

The correlation coefficient as such v.-oulJ 
not be affected by the composition of the 
eventual set of other tests. It would also be 
much more applicable, because it would not 
require the labor of administering a ratner 
large number of additional tests. 

Heinonen's hypothesis was subjected later 
to another experiment, done by Mclamecsi!< 
(1965). Melamets^ used typ,es of intellujence- 
test tasks and an experimental desiijn th<it 
was the same as Helnonen's, except that ho 
had. In addition, a control group working only 
on the pre- and posttests. The' training he 
provided centered on an oddity task (number 
group test) and contained 12 sessions of ten 
tasks each. 

Both the experimental and the control 
groups showed a significant mean change from 
pretest to posttest, but the experimental group 
exceeded the control group with statistical 
significance only in the learning task. Thus, 
further analysis of the relationship between 
transfer and cos 4) = t^^b/^^A^B^ ^° 
abandoned. 

Flammer (1970) hypothesized that t)he 
transfer would be a linear function of the 
squar'^d coefficient of correlation after c' erec- 
tion for attenuation, in order to protect Lhe 
sign of the correlation coefficient: 

I 

The correlation for attenuation was intended to 
correct for the bias by which different coeffi- 
cients are differently affected depending on 
the reliability of their variables. The division 
by the square root of the product of the reliability 
coefficients replaced the division by the square 
root of the product of the commonalities in the 
hypotheses by Heinonen and MelametsS. The 
argument for the squaring was to make the pre- 
dictor express the relative amount of predictable 
individual difference variance. 

An experiment was done including nine 
intelligence subtests selected from the BASC 
test battery (Cardinet & Rousson, 1967, 1968). 
These tests contained several series-continuation 
tasks, i.e., letter series, number series, geo- 
metric shape series, domino series, and date 
series, and some other tests like oddity prob- 
lems , letter matrices , homophones, and 
numerical distance determinations. 

The experimental design was the same as 
Helnonen*s, except that the training consisted 
of ten 30-minute 'sessions, each on a different 
day, and there was a control group which did 
only the pretests and the posttests. The 



experimental group consisted of 186 Ss and the 
control group of 235 Ss, all French-speaking 
5th and 6th grade Swiss boys and girls. The 
training consisted of a systematically guir^ed 
discovery of the principles underlyina ne kind 
of i:ask in the letter-series test. 

The trans ter per test was r.ieasured as mean 
raw gain of the experimental group minus the 
mean raw gain of the control group, divided by 
the pretest standard deviation. 

Table 1 contains the linear correlations 
rpc betwep.* the predictor correlations and the 
transfr: measures. The small number of transfer 
tests (eight) did not allow the smaller correlation 
coefficients to show up as statistically signifi- 
cant. The results may be summarized as follows; 
1 . There was a substantial linear compo- 
nent in the regression of the transfer 
per test on this correlation with the 
training test. 
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The correlations taken from the same 
group of Ss that also produced the 
transfer under investigation tended 
to be more predictive than those 
stemming from an unrelated group 
(cross-validation) . 
Posttest correlations tended to be 
more predictive than pretest correla- 
tions < as was the case in the cross- 
validation. 

Squaring the predictor correlation 
coefficients had little and ambiguous 
effect. 

The predictability increased after 
correction for attenuation « though 
very slightly. Since there also were 
only small differences in reliability, 
this effect could not show up more 
markedly. 



TABLE 1 

LINEAR CORRELATIONS Vjq BETWEEN TRANSFER 
MEASURES AND CORRELATIONS 



Predictors not corrected Predictors corrected 
f^r ^1.1. — — for attenuation 



for attenuation 



Predictors stemming from not squared squared not squared squared 



Exp. group/pretests 
Exp. group/postests 
Control group/pretests 
Control group/posttests 



.43 
.81* 
.29 
.48 



.44 
.81* 
.22 
.46 



.54 
.86* 
.29 
.51 



.57 
.86^ 
.19 
.49 



*p < .01 (two-tailed); if = 8 - 2 = 6 
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A Ccuceptiuri McnM 



The connection between transfer and corre- 
lation has hitherto been conceived of rather 
intuitively. Hammer (1970) argued that the 
covariation betweca tw« performances within 
a population is an expression of the extent to 
which the two performances are DsvchaioaicAWv 
or functionally the same or similar. Certainly, 
this correlational approach has the enormous 
advantage that it docs not require detailed task 
analyses, let alone a S-R decomposition, and 
that it makes the largo body of psychometric 
research available to learning theory and oven 
to a wide range of instructional application. 
But the approach also has the disadvantage that 
correlation coefficients are highly sensitive to 
subject sampling. 

The most important questions the author 
tries to clear up here are these: 

1 . How sho lid the link between the mere 
statistical correlation meacuro and 
the transfer of learning be conceived 
of more coherently or more psycho- 
logically ? 

2. Ir a more technical respect, but rele- 
vant because of the weakness of the 
conceptualization to date, is it ade- 
quate to square the correlation coeffi- 
cient or is It not? Regarding this, the 
empirical results so far are inconclusive. 
Are there stronger theoretical grounds 

to predict either the unsquared or the 
squared coefficient to be superior, i.e. , 
stronger than the argument by analogy 
that the transfer would be proportional 
to individual difference variance com- 
ponents? Since this question taps at 
the same time the incertitude concern- 
ing the adequate scaling of the transfer, 
one would like to base the scaling 
question on a common theoretical 
ground, too. 
The following model seems to have some 
ability to answer these questions. Assume a 
performance to be learned that consists of a 
number of elements whose extension corresponds 



to the average time required for a given popu- 
lation to master them. These elements are 
handled as hypothetical constructs and thought 
of as mutually exclusive and independent. 
Assume task X to be composed of such ele- 
ments and task Y of Oy elements. Let n^ be 
the number of elements which the given popula- 
tion of gs identifies correctly as being common 
to both tasks and n£ be the number of elements 
identified incorrectly as being common to both 
tasks. Both n^ and n£ need to be regarded as 
being ''identified'* only insofar as they are . 
learned . 

Concerning the transfer , we assume now 
that learning X (so-called original learning) 
facilitates later learning V or improves the 
performance Y to the extent that there are c- 
elements involved, which for their part have 
been affected by original learning, since they 
are included in X, too. If a proportion of k/n^ 
in X is learned (either k elements randomly 
selected and learned in the all-or-nonc fashion 
or all el4)ments learned to the degree k/n^), 
Y will profit to the extent n^ny. 



k^ n^ Oy • 

The analogue is true for elements falsely iden- 
tified as being common. These elements pro- 
duce a negative transfer component: 



k- n^ ny 
The net transfer is then 

n - n^ 

T.,»T.^-T. "k-^ . 

k k-^ k- n^n^ 

Concerning the product-moment correlati on 
between X and Y, we propose to retain out of 



the above proposed all-or-none and incremen- 
tal learning assumptions only the all-or-none 
one. Incidentally, if in the incrementai learn- 
ing case the elements are conceived of as ex- 
tremely small, the "error" introduced by dichot- 
omizing the degree of learning to the alternative 
"learned" and "not yet learned" approaches 
zero. And since these elements are mere hy- 
pothetical constructs, nothing hinders this 
assumption. 

The coefficient of the product-moment 
correlation between X and Y by definition is 
the following: 

^ ^ Gov CX.Y) 



\A/ar (X) Var (Y) 

From the above assumption of the unrelatedness 
of the elements. Gov (X,Y) Is In fact equal to 
the CO variance produced by the c-elements and 
the f-elements, i.e.: 



= n^d - v)(v(l - v) + v"") 

+ n£(v(l - v)(v- 1) +(1 - v)(-v)v) 
= n^v(l - v)(l - V + v) + n£v(l - v) (v - 1 - v) 
« n^^vd - v) -♦'njvd - v)(-l) 

Gov (X,Y) = (nc-nj)v(l - v) 

Since Var (X) = n^p(l - p) and Var (Y) = nyt(l - t) 
(n^- nf)v(l - v) 



P = 



V/n^pd - P)nyt(l - t) 



Define w = mln (p,t) , 
=> P 



) 

-w) 



For the special case that£.^_t. 



Gov (XY) = n^Gov (c,c') + n^Gov (f,f') . 

Since the correlation between two variables 
can never be different from zero If one or both 
variables have zero variance, a condition to 
evaluate a meaningful correlation coefficient 
ts that both learning tasks X and Y are already 
learned to a certain degree, but not completely. 
We therefore define p as the degree of mas- 
tery of X and t as the degree of mastery of Y 
(0<p<l, 0<t<l). Since for the c-elements 
and for the f-clements the probability to be 
mastered Is the same as the probability to be 
mastered for all those elements of that task 
that has the higher degree cf mastery, we 
define 

V = max (p,t) . 

Spelling out Gov(K,Y) produces: 

Gov (X.Y) = n^(v(l -v)(l -v) + (1 - v) (0 - v) (0 - v)) 



^ n£(v(l-v)(0-(l-v)) 
^ (1 -v)(0-v)(l -d -V))) 



(1) 



Here It Is assumed that all the f-elements 
are "right" *n the first task and "wrong" In the 
second task. The result would not be changed 
If the contrary were true or if there would be 
a mixture (some "right" In the first task and 
some In the second) . 



"c""f 



We remember 



^"y 



Substituting by p we get 

k 

T = P . 



This also makes Intuitive sense: The transfer 
is directly related to the amount k of so-called 
original learning, but It becomes small if 
either the original or the transfer task or both 
become "bigger," since we defined T from the 
beginning as the amount of facilitation or inter- 
ference relative to the amount of learning re- 
quired. 

If P/ t 



k /wd 



w) 



Adding the unity factor 



v) 



"v^w 



n n,, 
w V 



u^here = n^^, if max (p,t) = p 
= riy , if max (p,t) = t 
n^= n^, if min (p,t) = p 
= riy , if min (p,t) = 

gives 



= P 



' w n v(l - V 

n n 1 w W X 
X y ' ' 

k /n a 
n^ny Y w V 



w) 

r 



Since either {n^= and ny= n^) or {n^= 
and ny= n^) , 



k a 



(2) 



W V 



One important characteristic of this con-- 
ceptualization is that for a given learning time 
k, the transfer from the original learning to Y 
is the same as from Y to X. This is given by 
the definition of the amount of transfer relative 
to the amount of the total time needed to learn 
a given task completely and from zero. An 
intuitive reason for this is best demonstrated 



by some examples , arranged in Table 2 . One 
could paraphrase as follows: when n^ < Hy, 
X, with a given (n^ - n^) , produces a large 
"carryover" which in turn loses relative weight 
in the large Y. If, on t]\e other hand, n^^ > n 
with .the same (n^-n£), the carryover is smaller 
but its weight within Y is heavier. 

Table 2 shows that the transfer formula is 
also applicable to the special cases of "sir^ple 
learning" when either n^^ = n^, or ny = n^ , or 
n^^ = n^ = ny . 

Coming back to the question of whether 
the transfer is proportional to the squared or 
to the simple correlation coefficient, it is 
evident that within the proposed system of 
conceptualizing both the transfer and the corre- 
lation at once, only the unsquared correlation 
coefficient makes sense. Moreover, the rela- 
tion should hold even if p < 0.^^^ 

Concerning the width of application of the 
model a crucial question seems to be that of 
the definition of the task. The model itself re- 
quires the single c-elements and the f-elements 
to be learned at the same rate as all other elp- 
ments. This can theoretically be accomplished 
by having all single elements having the same 
probability to be sampled or in a case of a fixed 
sequence of elements by having the c-elements 
and the f-elements each arranged at equal in- 
tervals , i.e. , ^o/n^ and n£/n , respectively. 
Further research must determine how critical 
these conditions are and how their being met or 
not met can be decided objectively. 



(2) 

The interpretation of this result is 
complicatad, since growth of v increases cr^ 
as long as v < .5, but decreases 
Furthermore, it can happen that v > .5 and 
w < .5 , by which alone it could even not be 
decided whether (7^^/(7^ > 1 or cr^a^ < 1 or 



CT^ thereafter. 



1, 



(3) 

One might remember that Heinonen (1962) 
in fact expected negative transfer, provided it 
was negative for whatever reasons, to be higher 
(negative) the bigger the (cor-)relatlon as ex- 
pressed in terms of narrowness in the common 
factor space. 



TABLE 2 

ILLUSTRATIVE EXAMPLES OF TRANSFER OF LEARNING 



n 

X 


n 

y 


n - n- 

c f 


k 


k 
n 

X 


n - 

C I 

n 

y 


n - 
. c f 
T, = k 


5 


10 


3 


2 


2/5 


3/10 


3/25 


10 


5 


3 


2 


2/10 


3/5 


3/25 


5 


5 • 


3 


2 


2/5 


3/5 


6/25 


5 


3 


3 


2 


2/5 


3/3 


2/5 


3 


5 


3 


2 


2/3 


3/5 


2/5 


5 


5 


5 


2 


2/5 


5/5 


2/5 


5 


10 


-3 


2 


2/5 


-3/10 


-3/25 
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in ... 

Validation Ejxperiment^^^ 



Since the experiments done by Gengerelli 
(1934), Heinonen (1962), and Melamets^ 
(1965) were not conclusive, the affirmative 
results of the Flammef (1970) experiment 
needed a replication. In preparing this the 
following improvements of the design were 
effectuated at the same time: 

1. Since the posttest correlations seemed 
to be more predictive, a multitude 

of experimental groups with different 
learning times fehould manifest a 
region of degree of learning which 
' would yield the optimally predictive 
correlation coefficient. 

2 . In the former experiment the reliabil- 
ities of the variables were below psy- 
chometric stancjards for reasons of 
drastic testing time reductions. This 
could have been the cause for the 
lack of statistical significance of 

the prediction in using either experi- 
mental or control group pretest corre- 
lations. Adding and substituting 
some transfer tests and reextending 
the testing times should yield higher 
reliabilities . 

3. Since the squaring of the predictor 
correlations gave equivocal results, 
a broader range of the predictor cor- 
relations should allow for clearer 
effects . Broadening the range to an 
interval from r = 0 . 0 to r = . 65 , say , 
was also expected to raise the 
(relative) predictive value. 



iParts of the following results were 
prepared for the diploma-theses of Fldhler 
(1971), Kramis (1971), and Stdckli (1972), to- 
gether y/vith some analyses not covered here. 



Why should it be that posttest correlations 
are more predictive than pretest correlations? 
Although the correction for attenuation does 
eliminate the bias due to unreliability and, 
for a comparison of different tasks, compen- 
sate for unequal lacks of reliability, this 
correction never can raise the reliability it- 
self. The opposite is true: the standard error 
becomes even larger. It is therefore desirable 
in all cases to use variables which are as re- 
liiable as possible. It is common experience 
(Tinkelman, 1971, p. 63) that the reliability 
of tests is best when the item difficulty indices 
are around £= .5 or even a bit higher (Mikkonen, 
11972), since the probability of guessing is 
larger with difficult items than with easy ones. 
In the former experiment as in the one to be 
reported now, for reasons of trainability tests 
were chosen which were originally prepared 
for and normally administered to older students. 
Therefore it could be that the average £'s came 
closer to .5 through training. And the replica- 
tion experiment that included various lengths 
of learning times could therefore eventually 
show a decrease of predictive validity after 
an increase, namely after passing the point 
£= .5. 

Subjects. Three hundred and eight-five 
German- speaking Swiss fifth graders were 
equally distributed among three experimental 
and one control group — E3 , E6, E9 , and C. 

Learning task . Ail the training centered 
on tasks of the type of the test LS (letter series 
to continue), the same as in the earlier experi- 
ment. 

Transfer tasks > Twelve paper-and-pencil 
tests were chosen from Cardinet and Rousson 
(1967, 1968), Amthauer (1963), and Flammer 
et al. (1971). Again, there were several series 
continuation tests with different materials 
such as dominoes, geometric shapes, numerals 
in series, numerals in matrices, and ornaments. 
Other tests were less similar to the learning 
test, like analogies, short-term memory, and 
mathematical operations tests. All the tests 
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were rather difficult for the selected Ss, To 
depress further the probability of ceiling effects, 
all the normal testing times were reduced by 
one- fifth. 

To the extent that the Ss had different mas- 
tery levels on either the learning test and the 
transfer tests or on different transfer tests, the 
dependency of the transfer on the intercorrela- 
tion was expected to be altered according to 
the model proposed here. The same thing would 
be true for different "sizes" of the tasks. It is 
probable that these differences are rather slight, 
at least in the pretests. The following analysis 
will not take into account these eventual differ- 
ences and therefore will represent a (statisti- 
cally conservative) check of just the general 
features of the model. The main reason for this 
is that the practical evaluation of the n's and 
the £'s would require a learning time scale. 
Such a scale would first have to be effectuated 
experimentally--a very time-consuming proce- 
dure. 

Experimental training . The instruction 
was provided exclusively through booklets of i 
Skinner-type linear learning programs prepared ' 
specially for this experiment. The Ss had to 
learn how the letter series were constructed by 
using specially devised graphical means. All 
experimental ^s were presented with each kind 
of problem in the letter-series test. Those with 
extended training received more and increasingly 
difficult examples. 

Transfer scaling . All the pretest distribu- 
tions were normalized, and the scale yielded 
was used to determine the transfer for each test, 
which was defined as mean gain of the respec- 
tive experimental group minus the mean gain of 
the control group. 

Again, the model proposed here would have 
required learning ame scales. Since the sub- 
stitute chosen in this experiment can easily be 
effectuated, it would be very welcome if it 
would work satisfactorily. 

Hypotheses 

The general hypothesis was (1) that the 
regression of the transfer measures on the un- 
squared coefficient of intercorrelation after 
correction for attenuation would contain a sig- 
nificant linear component and nothing but a 
linear component. (2) The predictive value 
should increase with the lengthening of learn- 
ing time and eventually decrease after a certain 
point, for the reasons of reliability mentioned 
above. Certainly, according to the finally 
given formula for the transfer depending on p , 
n^, (Tyy, and cr^, at the same time a regression 
deteriorating effect was expected with the 



growing of the differences between the vari- 
ance of the transfer tests (presumably a^) . 
(3) The regression curve according to the 
model should yield no significant inter -pt, 
i.e. , £(T I r = 0) = 0. 

\ 

Results 

The prediction equations contained all 12 
transfer tests with their average transfer mea- 
sures and their intercorrelations with the train- 
ing test LS. Corresponding to the model, LS 
itself could have been included as a special 
transfer case, where n^^ = ny =i (n^ - n^) , The 
"intercorrelation" would have been equal to 
the reliability, and after correction for attenu- 
ation would equal the value 1.00, Since it was 
felt that relatively high learning effects on the 
training test were to be expected in any case, 
independently from the specific features of the 
model, it would not have been fair to include 
the learning test in the evaluation of the re- 
gression equations. Incidentally, some ex- 
ploratory analyses including the learning tests 
have been made: the results showed indeed 
much stronger overall relationships than the 
ones presented here. 

Since for practical reasons it was impossible 
to raise the number of transfer tests to more than 
12, the degrees of freedom for the prediction 
equation were rather low. A first trend analysis 
was therefore made graphically and by optical 
inspection. None of the scatter plots of the 
many possible combinations showed any ten- 
dency toward a nonlinear trend when the pre- 
dictor correlations were unsquared (FKlhler, 
1971). The further exposition of the results 
can therefore be restricted to the linear cor- 
relations TjQ as indices for predictive validity. 
For reasons which follow the question concerning 
the squaring of the correlation coefficients is 
postponed . 

The reliability coefficients for the unusual 
^-population and the reduced testing times were 
estimated as retest correlations of the control 
group. They varied between .40 and .75. Cor- 
rection for attenuation, not surprisingly, improved 
every prediction. It seems reasonable to include 
here only the results for which the correction 
for attenuation was done. 

Table 3 contains the r.jQ. With very few 
exceptions the r-j^'s are statistically signifi- 
cant, even in the cross-validation cases. Some 
coefficients are of an impressive height, i.e. , 
in the . 80's . 

Apparently the transfer of E6 was best pre- 
dicted with only one exception. At the same 
time the posttest intercorrelations were the 
best predictors, among them the ones stemming 
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TABLE 3 

CORRELATIONS BETWEEN PREDICTOR-CORRELATIONS AND TRANSFER MEASURES 



Predictor stemming from 


Predicted transfer in 
E3 E6 E9 


Average over 
all three groups 


Pretests C 


.58* 


.68* 


.57* 


.61 


E3 


.54* 


.63* 


.61* 


.59 


E6 


.62* 


.80** 


.75** 


. 73 


E9 


.57* 


.59* 


.47 


.55 


Average over pretests (F^q) 


.58 


.69 


.61 


.63 


Posttests C 


.57* 


.63* 


.48 


.57 


E3 


.81** 


.90** 


.82** 


.85 


E6 


.70** 


.88** 


. 86** 


.83 


E9 


.63* 


.58* 


.47 


.57 


Average over posttests (F-jq) 


.69 


.79 


. 70 


.73 


Average over pre- and posttests (F.jq) 


.64 


.74 


.66 


.68 



TjQ averaged on Fisher's z-scale 

*£< .05 (one-tailed) 
**£ < .01 (one-tailed) 



from E3 and E6. These differences, however, 
did not reach statistical significance, since 
the mean square between all the 24 correspond- 
ing Fisher's z,'s divided by the predefined error 
variance of z (= l/^ = l/9) , yielded F < 1 . 
All further statistical analysis of the 2 (pre/ 
posttests) X 4 (intercorrelations out of C, E3/ 
E6, and E9) x 3 (transfer in E3, E6, and E9) 
design seemed unadvisable after this. 

A puzzling characteristic of the results is 
that E9 not only yielded the worst predictors 
and transfer that was less predictable than 
E6/ but the worst predictions of all were those 
to which E9 furnished both transfer and pre- 
dictors , Several ceiling checks disconfirmed 
the ceiling hypothesis, be it for the learning- 
test raw scores or the 12 transfer-test raw 
scores , or even the transfer variance between 
the transfer tests. Since all the testing and 
training was done under the members of the 
research team, any manifest irregularity in 
this could be excluded, except for the obser- 
vation that the teachers of the E9 classes 
towards the end of the experiment began to be 
discontent with the amount of instruction time 
being lost because of the investigation. The 



teachers had been informed in advance of the 
exact times, but some had apparently under- 
estimated the plan. All that can be said in 
regard to E9 is that eventually the teachers' 
attitudes may have confused the Ss who in 
turn >ay then have taken the experiment less 
seric isly . 

'he model's prediction that the transfer 
would linearly depend on the unsquared cor- 
relation coefficient and not on the squared 
one Ads confirmed insofar as the r's yielded 
better predictions than the r^'s in 13 out of 
the 14 cases where E9 vvas involved neither 
in the predictors nor in the transfer measures. 
The 13 cases involving E9 showed the r's to 
be superior only five times. By disregarding 
the rather puzzling E9 group, the model* s pre- 
diction is to be considered as clearly con- 
firmed. 

The third hypothesis was concerned with 
the intercept of the linear regression equations 
expected to be zero. Table 4 shows that they 
were negative without exception. The largest 
departures were those of the cases which 
already had shown the most precise predictions. 
The third hypothesis is clearly to be rejected. 



11 



TABLE 4 

t~VALUES FOR DEPARTURE FROM ZERO OF THE 
INTERCEPTS OF THE LINEAR REGRESSION CURVES 









Predicted transfer in 


Predictor stemming from 


E3 




E6 


E9 


Pretests 


C 


- .72 


-1 


.93 


-1.05 




E3 


- .45 


-1 


.56 


-1.18 




E6 


-1.10 


-3 


.17** 


-2.36* 




E9 


- .73 


-1 


.46 


- .63 


Posttests 


C 


- .58 


-1 


.56 


- .56 




E3 


-2.56 


* -5 


.30** 


-3.08* 




E6 


-1.61 


-4 


.60** 


-3.89** 




E9 


- .97 


-I 


.37 


- .62 



*£ < .05 (two-tailed) 
**£< .01 (two-tailed) 
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IV 

Discussion 



In general , the confirmed predictions are 
not only statistically significant, but also of 
appreciable accuracy. This is true whv^n pre- 
dictors and transfer measures are from the same 
sample, and in the cross-validation case as 
well. Clearly the transfer is proportional to 
the unsquared correlation coefficient, not to 
the squared one . 

In opposition to what was to be expected 
from the model, the regression curve intercepts 
were negative. This had already been found in 
the 1970 experiment. 

Also, in both experiments there were a few 
tests with negative transfer measures . According 
to the model this can only happen if n^ > n^. 
But under the model's assumptions this would 
at the same time cause the correlation to be 
negative, and no non-zero intercept is justifiable 
by this. Negative intercepts need then to be 
caused by either an amount of negative transfer 
on which Ss do not vary or by some additional 
component of positive covariation which is not 
related to transfer. 

The first explanation does not seem to be 
too risky to this author. Standardized tests 
and especially experiments like the one dis- 
cussed here are rather exceptional events in 
Swiss schools. Thus, the unusual circumstances 
and the unusual emphasis of the experimental 
training may well have led all the experimental 
^s to invariably overgeneralize the learned kind 
of solution approach to all the other posttests. 
If this were proven to be true, the negative 
intercept would be typical only for either the 
specially emphatic training or the S sample or 
both. 

The second possible explanation that the 
author sees could call for a slight modification 
of the model. It could be argued that the co- 
variation between two tasks is the sum of two 
components, the one described by the model 
thus far and one produced by individual differ- 
ences in overall information processing rate, 
like personal speed, attitude toward intellectual 
performance, and engagement in the specific 



kind of experimental setup. At least for this 
length of training it seems plausible that such 
a covariation would not have noticeable transfer 
implications. To test this explanatory hypoth- 
esis, one could statistically hold constant 
some general performance variable like mental 
age. 

While both of these proposed explanations 
could account for negative regression intercepts, 
the first is much more likely because of the 
fact that some tests showed a negative trans- 
fer, which may be regarded as rather exceptional 
for the kind of cognitive tasks used. 

One thing this and the former experiment 
showed in comparison with the ones by Heinonen 
(1962) and Melamets^ (1965) is that transfer 
studies with psychometric tasks, which usually 
are chosen for their presumed stability among 
other characteristics, need to be done with 
rather extensive and potent instruction. This 
may be prohibitive for researchers a vj for the 
^s or the persons who decide whether Ss are 
available . 

Certainly the two former experiments also 
worked with the factor analytic angles as pre- 
dictors and not with the correlations. Reanalysis 
of Heinonen's data yielded r^^'s as follows: 
-.04 (pretest intercorrelations^ and .39 (post- 
test intercorrelations) , both corrected for atten- 
uation. The second Tjq is at least encouraging. 
It might have been higher had there been a con- 
trol group allowing subtraction of learning and 
transfer effects not attributable to the experi- 
mental training. The data by Melamets^ are 
not reanalyzable in this context, since the 
intercorrelations were not reported. 

The proposed model is confirmed with the 
presented data although several of its implica- ' 
tions still have not been investigated empiri-- 
cally. Particularly, these are the variations in 
p - t and rij^ ~ n , and also the negative transfer 
case. It is highly desirable to have a set of 
tasks with an empirically normed learning time 
scale. This scale would allow identification 
of p and t as well as n^ and ny — for the latter 
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two at least their relative values, which are the 
only ones that are needed, n^ and n^ never en- 
ter into a formula relating transfer to correlation. 

Finally it might be realized that the pro- 
posed approach as such is not yet concerned 
with individual differences in transfer, but 
uses the existence of individual differences 



in performance in order to get predictors for 
the average transfer. By this the differential 
(correlational) approach and the general (ex- 
perimental) approach (Cronbach, 1957) are 
neither mutually exclusive nor even indepen- 
dent and separate from one another. 
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